arXiv:1504.03632v2 [cs.IT] 26Jul2015 


Caching with Unknown Popularity Profiles in Small 

Cell Networks 


Bharath Bettagere Nagaraja 
Department of ECE, 
PESIT-Bangalore South Campus, 
Bangalore, India. 

Email: bharathbn@pes.edu 


Kyatsandra G. Nagananda 
Department of ECE, 

PES University, 
Bangalore, India. 

Email: kgnagananda@pes.edu 


Abstract —A heterogenous network is considered where the 
base stations (BSs), small base stations (SBSs) and users are 
distributed according to independent Poisson point processes 
(PPPs). We let the SBS nodes to posses high storage capacity 
and are assumed to form a distributed caching network. Popular 
data files are stored in the local cache of SBS, so that users can 
download the desired files from one of the SBS in the vicinity 
subject to availability. The offloadlng-loss is captured via a cost 
function that depends on a random caching strategy proposed in 
this paper. The cost function depends on the popularity profile, 
which is, in general, unknown. In this work, the popularity profile 
is estimated at the BS using the available instantaneous demands 
from the users in a time interval [0, r]. This is then used to find 
an estimate of the cost function from which the optimal random 
caching strategy is devised. The main results of this work are the 
following: First it is shown that the waiting time r to achieve an 
£ > 0 difference between the achieved and optimal costs is finite, 
provided the user density is greater than a predefined threshold. 
In this case, r is shown to scale as N^, where N is the support 
of the popularity profile. Secondly, a transfer learning-based 
approach is proposed to obtain an estimate of the popularity 
profile used to compute the empirical cost function. A condition 
is derived under which the proposed transfer learning-based 
approach performs better than the random caching strategy. 

I. Introduction 

The advent of multimedia-capable devices such as smart 
phones at economical costs has triggered the growth of mobile 
data traffic at an unprecedented rate. Eor instance, com¬ 
putationally intensive applications like content-based video 
analysis alone contributes a large fraction of the total mobile 
data traffic. This trend is likely to continue at an alarming 
pace, demanding wireless service providers to reevaluate the 
design strategies for the next generation wireless infrastructure 
- providing high data rates for tens of thousands of users, 
improving the spectral efficiency beyond what is currently 
available, improving coverage and cell-capacity, and engineer¬ 
ing efficient inter-device signalling techniques IT]. 

A promising approach to address the wireless infrastructure 
problem is to deploy small cells which can offload a significant 
amount of data from the macro BS Research on next gen¬ 
eration wireless infrastructure has primarily focussed on the 
design and implementation of such small cells El. However, 
a shortcoming of small cell-based architectures is that, during 
peak traffic hours, the backhaul-link capacity requirement to 
support data traffic is enormously high El. At the same 


time, the cost incurred in designing high capacity backbone 
network for small cells is quite discouraging especially from 
an economic standpoint. Thus, in the grand scheme of things, 
small cell-based solutions alone will not suffice to efficiently 
address the quality of service requirements associated with 
future traffic demands. Therefore, one is inspired to further 
investigate the advantages offered by heterogenous networks. 

A recent development in this direction is to improve the 
accessibility of data content to users by storing the most 
popular data files in the local caches of small cell BSs, 
with the objective of reducing the peak traffic rates 0 - 
0. This is commonly referred to as “caching” and has 
attracted significant attention. Eor instance, in 0, it was 
shown that, when cached-content demand was uniformly dis¬ 
tributed, joint optimization of caching and coded multicast 
delivery significantly improved the caching gains; this was 
extended to a more general decentralized setting in JTOl . The 
problem setup considered in 0 was extended to nonuniform 
distribution on cached-content demand in ifTTl . The tradeoff 
between the performance gain of coded caching and delivery 
delay in computationally intensive applications such as video 
streaming was characterized in na, ini; while, in m, 
coded caching was achieved for content delivery networks with 
two layers of caches. A polynomial-time heuristic solution 
was proposed in na to address the NP-hard optimization 
problem of maximizing the caching utility of users based on 
the mobility patterns of users. 

Caching has also made advances in device-to-device (D2D) 
communications. In CD, the outage-throughput tradeoff was 
characterized for D2D nodes having access to cached files in 
a library, but obtained the requested file from a third-party 
via a D2D link. In iflTl . the conflict between collaboration- 
distance and interference was identified among D2D nodes to 
maximize frequency reuse by exploiting distributed storage of 
cached content in those nodes. In ifTsl . coded caching was 
shown to achieve multicast gain in a D2D network, where 
users had access to linear combinations of packets from cached 
files. The NP-hard nature of distributed caching was reported 
in EH, where approximation algorithms were proposed for 
video content delivery. In 1201 . the throughput scaling laws 
of random caching, where users with pre-cached information 
made arbitrary requests for cached files, was studied. 


A. Main contributions of this paper 

In the above cited references, the popularity profile of data 
files was assumed to be known perfectly. However, in practical 
applications, this turns out to be a very weak assumption. In 
this paper, we relax the assumption on the knowledge of the 
popularity profile to devise a caching strategy. 

We consider a heterogenous network where the users, BS 
and SBSs are assumed to be distributed according to inde¬ 
pendent PPPs. Each SBS is assumed to employ a random 
caching strategy with no caching at the user terminal (see 
my A protocol model for communications is proposed using 
which a cost, which captures the backhaul-link overhead that 
depends on the popularity profile, is derived (Section |II|i. 
Secondly, assuming a Poisson request model, a centralized 
approach is presented in which BS computes an estimate 
of the popularity profile; this estimate is then used in the 
cost function to optimize the caching probability. Thus, the 
actual cost incurred differs from the optimal cost, and the 
difference depends on the number of samples used to estimate 
the popularity profile. Furthermore, the number of samples 
collected at BS is related to the density of the Poisson arrival 
process and the waiting time during which the samples are 
collected. We derive a lower bound on this waiting time which 
guarantees a cost that is within e > 0 of the optimal cost (this 
is derived in Theorem |2] Section Ull-Ab . 

The following are the main findings of our study: 

1) The waiting time is finite, provided the user density is 
greater than a threshold. 

2) The waiting time scales as N'^, where N is the total number 
of cached data files in the system. 

3) These results are improved by using a transfer learning 
(TL)-based approach to estimate the popularity profile 
wherein samples from other domains such as those ob¬ 
tained from a social network are used to improve the esti¬ 
mation accuracy. The minimum number of source domain 
samples required to achieve a better performance is derived. 
Employing the TL-based approach, a finite waiting time is 
achieved for all user density. 

4) In the TL-based approach, the waiting time is a function of 
the “distance” between the probability distribution of the 
files requested and the distribution of the source domain 
samples (the notion of distance is made precise in the proof 
of Theorem 12 Section Ull-Bb . 

Our work draws parallels with those reported in 11211 - ll23l . 
where a learning-based approach was used in which the 
popularity profile was estimated to devise the caching strategy. 
We emphasize that, unlike llHI - ll23l . this paper focuses on 
theoretical analysis of time and sample complexities to achieve 
the desired performance. To the best of our knowledge, this is 
first instance where an analytical treatment of waiting time and 
its relation to the probability distribution function of source 
domain samples has appeared in the literature on caching. 

The rest of the paper is organized as follows. In Section HU 
the system model is first presented followed by a communica¬ 
tions protocol for caching. The two methods for estimating the 


popularity profile and its corresponding waiting time analysis 
are developed in Section |III] Concluding remarks are provided 
in Section |IV] The proofs of main theorems of the paper are 
relegated to appendices. 

II. System Model and Communications Protocol 

A heterogenous cellular network is considered where the 
set of users (denoted <l>u C R^), the set of BSs (denoted 
C R^), and the set of SBSs (denoted C R^) are 
distributed according to independent PPP with density A^, Xb 
and Xu, respectively ll24l . Each user independently requests 
for a data file of size B bits in = {/i, f 2 , ■ ■ ■, In} with 
popularity V = {pi,... ,pn}, ^^d is assumed 

to be stationery across time. In a typical heterogenous cellular 
network, BS fetches the file using its backhaul link to serve 
the user. During peak data traffic hours, this results in an 
information-bottleneck both at BS as well as in its backhaul 
link. To overcome this problem, caching of the most popular 
files (either at the user nodes or at SBSs) is proposed - the 
requested file will be served directly by one of the neighboring 
SBS depending on the availability of the file in its local cache. 
The performance of caching depends on the density of SBS 
nodes, the cache size, the users’ request rate, and the caching 
strategy. It is assumed that the SBS can cache up to M files, 
each of length B bits. Each SBS s € <l>s caches its content in 
an i.i.d. manner by generating M indices distributed according 
to n = {iTi : fi e T}, TTi = 1 (see |[T6l). 

The following communication protocol is used in this paper 
which determines the set of neighbor SBS nodes for any user 
u. Each SBS s at Xg € communicates with a user u at 
Xu G *511 if —a^sll < 7 , (7 > 0 ); this condition determines 
the communication radius. Note that, we have ignored the 
interference constraint. The set of potential neighbors of user 
u at Xu is denoted Afu — {y & '■ \\y — a:u|| < 7 }- 

The performance of the caching scheme depends on the met¬ 
ric used to optimize the strategy. We focus on the performance 
of a typical user located at the origin. Since the main goal is 
to reduce the information-bottleneck at BS, the objective is to 
minimize the time overhead due to the unavailability of the 
requested file. The following “offloading loss” metric is used 
to optimize the caching strategy proposed in this paper: 

r{u,v)^^E[i{f,(^Afu}], (1) 

Kq 

which captures the unavailability of the requested file from a 
typical user at the origin. Rq is the rate supported by BS to 
the user at the origin, and is the time overhead incurred in 
transmitting the file from BS to the user. The expectation is 
with respect to and V. The indicator function 1{A} 

is equal to one if event A occurs, and zero otherwise. Better 
performance can be achieved by minimizing T(n,7^), i.e., 

min r(n, V) (2) 

n^o 

N 

subject to = 1, TTi > 0, 

i=l 






for i = 1,..., To solve the optimization setup (|2]), we need 
an expression for T(n, V), which is derived in the following. 

Theorem 1: For the caching strategy proposed in this paper, 
the average offloading loss is given by 


r{n,v) 


B 

Ro 


■ N 

exp{-Au7r7^(l - Tr^)^}pi 


(3) 


Proof: See Appendix lAl ■ 

In general, the optimization problem (|2]i is non-convex. 
More importantly, the popularity profile V is unknown, and 
is estimated from the available data. Denoting the estimated 
popularity profile by "P = {pi,... ,pjv} and the corresponding 
offloading loss by T(n, P), problem (|2]) can be reformulated 
as follows: 


min r(n,P) (4) 

n^o 

N 

subject to^^ TTi = 1. 

i=l 

The solution to the reformulated problem (|4]i differs from 
that of the original problem (|2]i. Let II* and 11* denote the 
optimal solutions to the problems in ([2]) and (|4]i, respectively, 
and let the throughput achieved using II* be denoted T* = 
T(II*,P). The central theme of this paper is the analysis of 
the offloading loss difference where T* = 

Theorem|2]and Theorem[3are given over to this analysis. We 
assume the minimization problems (| 2 ]) and (IHi can be solved 
in polynomial time. 


III. Learning Complexity 

In this section, we study two quantities of interest, namely, 
the time complexity involved in obtaining the samples and 
estimating the empirical risk to achieve a performance within 
e > 0 of the optimal solution. The efficiency of the estimate V 
of the popularity profile depends on the number of available 
samples, which in turn is related to the number of requests 
made by the users. We first define the user-request model. 

Definition 1: (Request Model) Each user requests for a file 
/ S P at a random time t G [0, cxd] following an independent 
Poisson arrival process with density Xr > 0. 

For notational convenience, the same density is assumed 
across all the users. A centralized scheme is used where BS 
collects the requests from all the users in its coverage area in 
the time interval [0, r], to estimate the popularity profile of the 
requested files. Let the number of users in the coverage area 
of BS 6 S of radius i? > 0 be n/j, which is distributed 
according to a PPP with density A„. Let the number of requests 
made by the user u € {$„ fj B(0, i?)} in the time interval 
[0, r] be ku, where B(0, R) is a two-dimensional ball of radius 
R centered at 0. We assume that requests across the users 
are known at BS. The requests from the user u is denoted 
Xu = {X ^\..., where X^'' G {I,..., A^} denotes 

the indices of the files in / = 0,..., ku- After receiving Xu, 


u G {<I>u fj B(0, i?)}, in the time interval [0,t], BS computes 
an estimate of the popularity profile as follows: 


1 


Pi = 


E. 


h 


Y. nxL‘> = i]. 


ixG{B(o,it)n*u} “ i=o 


(5) 

i = 1,..., N. Given the number ur of users in the coverage 
area of BS, the term ^^€{ 8 ( 0 .® ^ 

density nnXr- Also, E {pi| QB(0, i?)}| = n^} = pi, 
from which we conclude that pi is an unbiased estimator. 
The estimated popularity profile pi is shared with every SBS 
in the coverage area of BS, and then used in © to And 
the optimal caching probability. The derived estimator can be 
improved by using samples from other related domains, for 
example, a social network. The term “target domain” is used 
when samples are obtained only from users in the coverage of 
BS. Next, we derive the minimum waiting time r required to 
achieve the desired accuracy of e. 


A. Target domain-only learning 

In this subsection, we derive a lower bound on the waiting 
T corresponding to the estimator in Q. 

Theorem 2: For any e > 0, with a probability of at least 
1 — J, an offloading loss of T* < T* -f e can be achieved 
using the estimate in © provided 


T > 



oo 


if A„ > C, 
otherwise, 


( 6 ) 


where {a:}+ = max{a::, 0}, p* = (I — exp{—2e^}), C = 
dt^log(^) and 


_ ^_ fipe _ 

2BsupnE*=iff(7ri)’ 


(7) 


with g{TTi) = exp{-Au 7 r 7 ^(I - 
Proof : See Appendix iBl ■ 

To achieve a finite waiting time that results in an accuracy 
of e > 0 , the user density Xu has to be greater than a threshold. 
Further insights into © are obtained by making the following 
approximation: 1 — x < e~^ for all a: > 0. This combined 
with supn:n:^o,i'^n=i ^ X yields the following 

lower bound on r: 


T > 


2B^ 

irR^XuXrR^e^ 


log 


A few observations are in order: 



( 8 ) 


(1) The waiting time r to achieve an e-offloading loss differ¬ 
ence scales as A^^; 

(2) T is inversely proportional to (A„, Ar); 

(3) as the radius of coverage increases, the delay decreases as 

and 

(4) as the file size B increases, the waiting time scales as B^. 
The above result is a lower bound on the waiting time per 
request per user, since the offloading loss is derived for a given 
request per user. There are on an average Xr requests per unit 









time per user. Thus, to obtain the waiting time per user, the 
offloading loss has to be multiplied by A^. This amounts to 
replacing e by e/Ar, resulting in the following lower bound; 


T > 


2B^Xr 

7ri?2Aiii?ge^ 


log 



(9) 


It is clear that the waiting time scales linearly with A^. 
Although the waiting time per user per request tends to zero 
as Xr oo, the waiting time per user tends to oo. This 
is because the number of requests per unit time approaches 
oo, and thus, a small fraction of errors results in an infinite 
difference in offloading loss leading to an infinite waiting time. 
In the following subsection, a TL-based approach is devised 
to improve the waiting time. 


B. Transfer learning to improve the waiting time 

In general, the minimum waiting time required to achieve 
an accuracy of e > 0 can be very large. A recent approach to 
alleviate this shortcoming is to utilize the knowledge gained 
from users’ interactions with a social community (termed 
“source domain”). Specifically, by cleverly combining samples 
from the source domain and users’ request pattern (i.e., the 
target domain), one can potentially reduce the waiting time. 
However, the accuracy is indicative of the dependency between 
the source and target domains. Thus, TL-based approach 
and its impact on the time complexity to achieve a given 
accuracy is of paramount importance. TL-based approach was 
also employed in ll25l to negotiate over-fitting problems in 
estimating the content popularity profile matrix. However, in 
this paper, we are interested in finding the minimum waiting 
time to achieve a desired performance accuracy. Furthermore, 
the model considered in this paper is different from the one in 
II25I . The TL-based approach considered here comprises two 
sources from which the samples are drawn, namely, source 
domain and target domain. 

An estimate of the popularity profile is obtained as follows. 
First, using target domain samples, the following parameter is 
estimated at BS 

E (10) 

i = 1,... ,N, similar to 0. We denote the source domain 
samples = {X^,... ,X^}, which is drawn i.i.d from a 
distribution Q. Here, X^ = i{i = 1,... ,N) denotes that the 
user corresponding to the Z* sample has requested the file fi. 
Using this, BS computes 

m 

= * = (11) 


Now, BS uses (fTOl t and (HB to compute an estimate of 
(the superscript (tl) indicates transfer learning) given by 


^(tZ) A 

Pi = 


+ Sf 


S«G{B(0,fl) n 'I’u} 


( 12 ) 


Using the estimate (fT2l) . we provide a bound on the time 
complexity in the following theorem. 

Theorem 3: For any accuracy 

2 Bsupn 3(7^0} 

e>-- HV-Q\\h, (13) 

no 

with a probability of at least 1 — 5, an offloading loss T* < 
T* + e can be achieved using the estimate in (fTB provided 

T>| { v(i-A--Z.) ‘°«(A)} ifA.>A „4, 
[ 00 otherwise, 


where p = ^ 


^ (log^ -2e2^m), epq = e- \\V- 
1 


m. 


A4 


XuT^B? 


. 2W „ 2 

log — - 2e^ 


pq 


m 


(15) 


' - 2asupn|E.Lg(^0} ’ - exp{-A„7r72(l - 

and \\V - Q\\n = sup^gji^^j \q, - pi\. 

Proof: See Appendix ICl ■ 

From Theorem [2 we see that under suitable conditions the 
TL-based approach performs better than the source domain 
sample-based agnostic approach. The following inferences are 
drawn; 


(1) The minimum user density to achieve a finite delay is 

reduced by a positive offset of In fact, for m > 

log {m) 

2(£-[|77-E||w) ’ ^ delay can be achieved for all user 
density which provides significant advantage. 

(2) The finite delay achieved is smaller compared to source 
domain sample-based agnostic approach for large enough 
source samples, and the distributions are “close.” This is 
made precise in the following proposition. 

Proposition 1: For any e > 0 and 5 G [0,1], the TL- 
based approach performs better than the source sample-based 
agnostic approach provided the number of source samples 
satisfies 


m > 


1 

2e^ 




(16) 


and the distributions satisfy the following condition 


where 


F 


1 — e 


= XuT^F? ( ^ ~ I 1— 


-2£" 




(l-L) 


(17) 


(18) 


and C = 


Au 7 ri ?2 


iog(¥)- 


IV. Concluding Remarks 


This paper considered the problem of caching in a dis¬ 
tributed heterogenous cellular network, when the popularity 
profiles of cached data files were unknown. A heterogenous 
network was considered where BS, SBSs and users were 
distributed according to independent PPPs. SBSs stored high 















data-rate content, which could be downloaded directly by one 
of the users from a SBS in the vicinity. A metric was proposed 
that captured the offloading-loss, which was used to optimize 
a random caching strategy. This metric was shown to be a 
function of the popularity profile (assumed unknown). 

The popularity profile was estimated at BS using the avail¬ 
able instantaneous demands from users in a time interval [0, r]. 
We showed that a waiting time r to achieve an e > 0 difference 
between the achieved cost and the optimal cost was finite, 
provided the user density was greater than a threshold. In 
this case, r was shown to scale as N'^, where N was the 
support of the popularity profile. A TL-based approach was 
proposed to estimate the popularity profile, which was then 
used to compute the empirical cost. A condition was derived 
under which the TL-based approach performed better than the 
random caching strategy. 

Although TL-based approach performs better, the improve¬ 
ment is not without a price. The error that is achieved in (fTSl l 
depends on WV — Q\\u^ suggesting that lower the distance 
between the two distributions better the TL scheme performs. 
If \\V - Q\\n = 0, the TL-based approach performs signifi¬ 
cantly better than the other proposed scheme. However, from 
Proposition [T] the benefits of using target domain samples can 
only be realized with the knowledge of the distance ||P — Q||?i. 
In practice, an accurate estimate of this distance is difficult to 
obtain. However, there could be hope of finding an efficient 
estimate of ||P - Q\\h, and this is relegated to future work. 

Other potential avenues for future research for the problem 
setup considered in this paper include; 

(1) The popularity profile was assumed to be constant across 
users and stationary in time, which, in practice, may not be 
justifiable. Analysis of the performance of caching strate¬ 
gies by relaxing these assumptions merits investigation. 

(2) Characterizing the performance of TL-based caching when 
the popularity profile is estimated using linear combi¬ 
nations of source domain and target domain samples is 
another interesting topic for research. 
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Appendix A 
Proof of Theorem[T] 

Consider the first term in ([T]i 

El{/, i Mu} = E„, Pr{/, i Mu\ IV.I = nj 


*'= E„^ [Pr{/i ^ a, for some a e 



N 

= ^exp{-if}pi, (19) 

i^l 

where U = As7r7^ [l — (1 — TTi)^]. In the above exposition, 
(a) follows from the fact that the random caching proposed in 



this paper is independent across users, (6) follows since the 
cache size is fixed to be M, and (c) follows due to the fact 
that Us is a PPP with mean As7r7^ since Ug is the number of 
SBSs in a circular area of radius 7 . ■ 

Appendix B 
Proof of Theorem[2] 

First, for any e > 0, the following inequality is proved 
Pr{r* > r* + e} < Pr J 2 sup \AT\ > e} 


where AT = Till, V) - T{n, V). We have, 

f*-r* = f*-miTiIl,V) 

n 


< T* - T + sup 
n 


< 2 sup 

n 


Tin, V) - Tin, V) 
Tin,v) - Tin,v) 


{ N > 

sup^5(7ri)4 > e 


< Pr 


max (5r, > e 

i=l,2,...,7V ^ 


floe f A 
2B ’ ^ 


27VE^exp{-p} 


iXrriRTy 


n—0 


T > 


■log 




Appendix C 
Proof of Theorem[3] 

As in the proof of Theorem |2l it is easy to see that 


Pr{r* >T* + €}<Py\ sup 

' l<i<N 


4tl) 

Pi - Pt 


> e 


where e = 




(26) 

, and gi-Ki) = 


( 20 ) 


2 Bsupn{E^i gTi)}" 

exp{—A„7r7^(l — iTi)^}. Denote by Up the total number of 
requests in the coverage area of BS. As mentioned previously, 
conditioned on the number of users ur in the coverage area 
of BS, Up is a Poisson distributed random variable with 

» -I-. , -rr-. I 1 /KM 

Ti- 


density urT- Further, 
Using this, we can write 


_ ' tp 


rip+m-Pt I np+m'= 


( 21 ) 


where T = T(n,'P), thus proving ( |20] |. Now, 

consider the right hand side of (l20l i after substituting 
for Tin,P) and T(n,'P) from Q to get 
Prjsupn Ylf=i9i'^i)iPi - Pi) > e}/ which can be upper 
bounded as follows; 


Pr sup 

< Pr sup 

U<j<N 

< Pr < sup 

I l<i<N 

< Pr < sup 

I l<j<N 


pf +Epf^ -p. 


> e 


pfl -Epf^ + Epf^ -p. 


> e 


(27) 


p\ - Epl 


pf) _ Epf) 


> e — sup 
ie[i,Ar] 


> e - 


Up + m 


'^pf^ - Pi 


WV-QWn 


< Pr sup 

^l<i<N 


< > e} 

i=l 

< 2 NE [ eicp {- 2 pnp ]] (22) 


pI - 

< WE„^Pr{|pf)-Epf^ 


>e-\\V-Q\\n\np 

> e- WV - Q\\n I , 

provided e > HT* — QH-h- Now, employing Hoeffding’s 
inequality, we can write 


supn{EiIi gTi)}’ 


where Sp = \pi - Pi\, e = 

and giiTi) = exp{— Au7r7^(l — iTi)^}. The last inequality 
above follows by applying Hoeffdings inequality (see 1261), 
since the estimator V is unbiased and tt^, tt G [0,1]. The 
expectation in (l22li is with respect to Up. Conditioned on the 
number of users ur in the coverage area of BS, Up is a 
Poisson distributed random variable with density URXrT. We 
let g = iyPk + XtUrt), and p* = (l — exp {—2e^}). Using 
this, we get 


00 h. 


2WE„p exp {-2elginp + m)} 

= 2WE„j,exp{-(2e^^m +Aprii^r)} ^ 

= 2Nexp{-2elgm}E„^exp{-gpq} 

= 2N exp [—2epgm} exp{—A„7ri?^} x 

00 

exp{-ApZr(l - exp{-2e2 J)} 


iXuTTR'^y 


1=0 


= 2NEn^ expl-XrURTg*}. 

(23) 


This can be further simplified as 

exp{-XrkTg*} exp{-A^7ri7^} ^ = 

2A^exp{— A„7ri?^ (1 — exp{— Ar-rp*}) . (24) 

Now, it is easy to see that Pr {supjj \AT\ > f} < <) if the 
expression in (l24l is upper bounded by 5, which implies that 


where a = A^nfiTexp {-2e2^}, epq = e - WV - QWn and 
gpq = XrriRTil — exp{—2epq}). Therefore, we have 

2A7E„p exp {-2ep^(np + m)} 

= 2exp {—2epgm} exp{—Au7ri?^f}, (28) 

where f = (l — exp{—A^r (l — exp{—26^^})}). The above 
is at most <5 > 0 if 


T > 


1 


■log 


1 


(25) 


provided A^ > log 2^, otherwise r = 00 . This completes 
the proof of the theorem. ■ 


A,(l-e \^l-^^(log2f-2e2^m)y 

provided 

A« > ^ (log X ■ 

otherwise r = 00 . This completes the proof of Theorem [3 ■ 









































